EN FR
EN FR
STARS - 2014
Overall Objectives
New Software and Platforms
New Results
Bilateral Contracts and Grants with Industry
Bibliography
Overall Objectives
New Software and Platforms
New Results
Bilateral Contracts and Grants with Industry
Bibliography


Section: New Results

Head Detection Using RGB-D Camera

Participants : Marine Chabran, François Brémond.

keywords: RGB-D camera analysis, head detection, serious games

The goal of this work is to improve a head detection algorithm using RGB-D sensor (like a Kinect camera) for action recognition as part of a study of autism. The psychologists want to compare the learning process of children with autism syndrome depending on games (digital or physical toys).

The algorithm described in [79] represents a head by its center position. It takes three steps to determine this point :

  • Determine possible head center positions using a head model : inner circle radius=6 cm, outer circle radius=20 cm (Figure 13 ).

    A good inner point is a point on the inner circle verifying :

    d e p t h H e a d C e n t e r + 30 c m > d e p t h I n n e r P o i n t > d e p t h H e a d C e n t e r - 30 c m .

    A good outer point is a point on the outer circle verifying :

    d e p t h H e a d C e n t e r < d e p t h O u t e r P o i n t + 15 c m .
  • Merge close head centers separated by less than 4 pixels.

  • Select final head center according to its score (calculated according to the number of good inner and outer points).

Figure 13. Each circle is divided in n parts (n=8). The points on the inner circle must have a similar depth with the center point, the points on the outer circle must be further than the center point compared to the camera
IMG/InnerOuter.jpg

For now, it works well within video where people are close to the camera (about 1 meter) and without any background just behind them (Figure 14 ).

The problem is when the person is sitting and the head is ahead of the body (Figure 15 ) or close to a wall, the difference between head depth and outer circle depth becomes not sufficient (about 10 cm).

We have evaluated the performance of this algorithm with two data sets (Table 1 ). For Lenval Hospital data set, we have evaluated 2 series of 200 frames, for the Smart Home data set, we have evaluated 3 series of 300 frames (a total of 1300 heads).

Table 1. Performance of head detection and people detection on two different data sets.
Videos Head Detection (%) People detection (%)
Lenval Hospital dataset (Figure 14 ) 89.7 96.9
Rest home dataset (Figure 15 ) 62.8 85.3
Figure 14. Result of head detection - good detection. The bounding box represents the person, the small blue circle represents the head center.
IMG/goodDetectionBlur.jpg IMG/goodDetectionDepth.jpg
Figure 15. Result of head detection - wrong detection. The green bounding box represents the person, the small blue circle represents the head center.
IMG/headDetection.jpg IMG/headDetectionDepth.jpg